On Optimal Pipeline Processing in Parallel Query
نویسندگان
چکیده
A key assumption underlying query optimization schemes for parallel processing is that their cost models can deal with the multitude of eeects encountered during the execution phase. Unfortunately, this is rarely the case and the optimal processing is only achieved in a few situations. In this paper we address the problem to achieve optimal processing under a pipelined execution strategy. The approach taken is based on a novel analytical framework|which establishes a formal treatment of both dataaow and processing environment|to validate execution strategies. The framework is based on the notion of-optimality which reeects an execution strategy's ability of ad-hoc resource utilization.-optimal strategies are insensitive to skew and provide a transparent interface to parallelism as they ensure a provable near-optimal exploitation of the processing environment. Finally, we discuss several strategies and present a-optimal execution strategy. Experiments carried out on an SMP verify our considerations: The new algorithm outperforms conventional pipelining execution substantially and is resistant against various kinds of skew.
منابع مشابه
On Optimal Pipeline Processing in Parallel Query Execution
A key assumption underlying query optimization schemes for parallel processing is that their cost models can deal with the multitude of e ects encountered during the execution phase. Unfortunately, this is rarely the case and the optimal processing is only achieved in a few situations. In this paper we address the problem to achieve optimal processing under a pipelined execution strategy. The a...
متن کاملParallel Query Processing
With relations growing larger and queries becoming more complex, parallel query processing is an increasingly attractive option for improving the performance of database systems. The objective of this paper is to examine the various issues encountered in parallel query processing and the techniques available for addressing these issues. The focus of the paper is on the join operation with both ...
متن کاملStochastic Heuristic Optimization based Multi-Query Processing in Wireless Sensor Network using Genetic Algorithm
Wireless Sensor Network is an infrastructure comprising of sensing, and computing. The communication elements in sensor network give capability to instrument, watch, and respond to events and phenomenon in a particular situation. Query processing in sensor network first transfers the query generated position to the node where the result obtained is similar to this query. Collaborative Query-Cen...
متن کاملParallelizing Skyline Queries for Scalable Distribution
Skyline queries help users make intelligent decisions over complex data, where different and often conflicting criteria are considered. Current skyline computation methods are restricted to centralized query processors, limiting scalability and imposing a single point of failure. In this paper, we address the problem of parallelizing skyline query execution over a large number of machines by le...
متن کاملLeveraging Non-Uniform Resources for Parallel Query Processing
Clusters are now composed of non-uniform nodes with different CPUs, disks or network cards so that customers can adapt the cluster configuration to the changing technologies and to their changing needs. Such modular clusters challenge parallel databases. The load balancing techniques used by existing parallel databases partition data across a set of nodes that all run the same relational operat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998